WHAT IS CLAIMED IS: 

1. An information processing apparatus comprising: 
a reader, arranged to read a document image; 

a recognition section, arranged to recognize 
5 character strings of the read document image; 

a extractor, arranged to extract a character 
string indicating contents of a document from the 
recognized character strings; and 

a synthesizer, arranged to synthesize and output 
10 speech based on the chosen character string. 

2. The apparatus according to claim 1, wherein said 
extractor extracts the character string indicating the 
contents of the document on the basis of the recognized 
character string, a location of the character string on 

15 the document image, a character size and color, and 
vertical or horizontal writing. 

3 . The apparatus according to claim 1 , wherein said 
extractor extracts a title of each page of the 
document . 

20 4. An information processing apparatus comprising: 
a reader, arranged to read a document image; 
a recognition section, arranged to recognize 
character strings of the read document image; 

an extractor, arranged to extract character 
25 strings indicating pages of documents from the 
recognized character strings; 

a determiner, arranged to determine a page order 
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of documents and/or omission of a page on the basis of 
the extracted character strings ; and 

an output section, arranged to output speech 
corresponding to a determination result. 
5 5 . The apparatus according to claim 4 , wherein said 
output section comprises a synthesizer arranged to 
synthesize speech that represents the determination 
result . 

6. An information processing apparatus comprising: 
10 a reader, arranged to read a document image ,- 

a recognition section, arranged to recognize 
character strings of the read document image; 

an extractor, arranged to extract a character 
string, which represents the number of pages of 
15 documents, from character strings recognized from a 
first page of a document; 

a counter, arranged to count the number of pages 
of documents; 

a comparator, arranged to compare the extracted 
20 character string and a count value of said counter; and 

an output section, arranged to output speech 
corresponding to a comparison result. 

7. The apparatus according to claim 6, wherein said 
output section comprises a synthesizer arranged to 

25 synthesize speech that represents the comparison 
result . 

8. An information processing method comprising steps 
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Of: 

reading a document image; 

recognizing character strings of the read 
document image; 
5 extracting a character string indicating contents 

of a document from the recognized character strings; 
and 

synthesizing and outputting speech based on the 
chosen character string. 
10 9. An information processing method comprising steps 
of: 

reading a document image; 

recognizing character strings of the read 
document image ; 
15 extracting character strings indicating pages of 

documents from the recognized character strings; 

determining a page order of documents and/or 
omission of a page on the basis of the extracted 
character strings; and 
20 outputting speech corresponding to a 

determination result. 

10. An information processing method comprising steps 
of: 

reading a document image; 
25 recognizing character strings of the read 

document image ; 

extracting a character string, which represents 
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the number of pages of documents, from character 
strings recognized from a first page of a document; 

counting the number of pages of documents; 

comparing the extracted character string and 
5 count value; and 

outputting speech corresponding to a comparison 
result . 

11. A computer program product storing a computer 
readable medium comprising a computer program code, for 
10 an information processing method, the method comprising 
steps of: 

reading a document image; 

recognizing character strings of the read 
document image; 

15 extracting a character string indicating contents 

of a document from the recognized character strings; 
and 

synthesizing and outputting speech based on the 
chosen character string. 
20 12. A computer program product storing a computer 

readable medium comprising a computer program code, for 
an information processing method, the method comprising 
steps of: 

reading a document image; 
25 recognizing character strings of the read 

document image; 

extracting character strings indicating pages of 
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documents from the recognized character strings; 

determining a page order of documents and/or 
omission of a page on the basis of the extracted 
character strings; and 
5 outputting speech corresponding to a 

determination result - 

13. A computer program product storing a computer 
readable medium comprising a computer program code, for 
an information processing method, the method comprising 
10 steps of: 

reading a document image; 

recognizing character strings of the read 
document image; 

extracting a character string, which represents 
15 the number of pages of documents, from character 

strings recognized from a first page of a document; 
counting the number of pages of documents; 
comparing the extracted character string and 
count value; and 
20 outputting speech corresponding to a comparison 

result . 
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